Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 31
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
G3 (Bethesda) ; 14(2)2024 Feb 07.
Artigo em Inglês | MEDLINE | ID: mdl-38066578

RESUMO

Pigeons and doves (family Columbidae) are one of the most diverse extant avian lineages, and many species have served as key models for evolutionary genomics, developmental biology, physiology, and behavioral studies. Building genomic resources for columbids is essential to further many of these studies. Here, we present high-quality genome assemblies and annotations for 2 columbid species, Columba livia and Columba guinea. We simultaneously assembled C. livia and C. guinea genomes from long-read sequencing of a single F1 hybrid individual. The new C. livia genome assembly (Cliv_3) shows improved completeness and contiguity relative to Cliv_2.1, with an annotation incorporating long-read IsoSeq data for more accurate gene models. Intensive selective breeding of C. livia has given rise to hundreds of breeds with diverse morphological and behavioral characteristics, and Cliv_3 offers improved tools for mapping the genomic architecture of interesting traits. The C. guinea genome assembly is the first for this species and is a new resource for avian comparative genomics. Together, these assemblies and annotations provide improved resources for functional studies of columbids and avian comparative genomics in general.


Assuntos
Columbidae , Genoma , Animais , Columbidae/genética , Guiné , Evolução Biológica
2.
bioRxiv ; 2023 Oct 14.
Artigo em Inglês | MEDLINE | ID: mdl-37873124

RESUMO

Pigeons and doves (family Columbidae) are one of the most diverse extant avian lineages, and many species have served as key models for evolutionary genomics, developmental biology, physiology, and behavioral studies. Building genomic resources for colubids is essential to further many of these studies. Here, we present high-quality genome assemblies and annotations for two columbid species, Columba livia and C. guinea. We simultaneously assembled C. livia and C. guinea genomes from long-read sequencing of a single F1 hybrid individual. The new C. livia genome assembly (Cliv_3) shows improved completeness and contiguity relative to Cliv_2.1, with an annotation incorporating long-read IsoSeq data for more accurate gene models. Intensive selective breeding of C. livia has given rise to hundreds of breeds with diverse morphological and behavioral characteristics, and Cliv_3 offers improved tools for mapping the genomic architecture of interesting traits. The C. guinea genome assembly is the first for this species and is a new resource for avian comparative genomics. Together, these assemblies and annotations provide improved resources for functional studies of columbids and avian comparative genomics in general.

3.
Cell Rep ; 42(3): 112263, 2023 03 28.
Artigo em Inglês | MEDLINE | ID: mdl-36930644

RESUMO

Programmed DNA loss is a gene silencing mechanism that is employed by several vertebrate and nonvertebrate lineages, including all living jawless vertebrates and songbirds. Reconstructing the evolution of somatically eliminated (germline-specific) sequences in these species has proven challenging due to a high content of repeats and gene duplications in eliminated sequences and a corresponding lack of highly accurate and contiguous assemblies for these regions. Here, we present an improved assembly of the sea lamprey (Petromyzon marinus) genome that was generated using recently standardized methods that increase the contiguity and accuracy of vertebrate genome assemblies. This assembly resolves highly contiguous, somatically retained chromosomes and at least one germline-specific chromosome, permitting new analyses that reconstruct the timing, mode, and repercussions of recruitment of genes to the germline-specific fraction. These analyses reveal major roles of interchromosomal segmental duplication, intrachromosomal duplication, and positive selection for germline functions in the long-term evolution of germline-specific chromosomes.


Assuntos
Petromyzon , Animais , Petromyzon/genética , Cromossomos/genética , DNA/genética , Genoma , Vertebrados/genética , Células Germinativas , Evolução Molecular , Filogenia
4.
Artigo em Inglês | MEDLINE | ID: mdl-36379720

RESUMO

Congenital myasthenic syndrome (CMS) is a group of 32 disorders involving genetic dysfunction at the neuromuscular junction resulting in skeletal muscle weakness that worsens with physical activity. Precise diagnosis and molecular subtype identification are critical for treatment as medication for one subtype may exacerbate disease in another (Engel et al., Lancet Neurol 14: 420 [2015]; Finsterer, Orphanet J Rare Dis 14: 57 [2019]; Prior and Ghosh, J Child Neurol 36: 610 [2021]). The SNAP25-related CMS subtype (congenital myasthenic syndrome 18, CMS18; MIM #616330) is a rare disorder characterized by muscle fatigability, delayed psychomotor development, and ataxia. Herein, we performed rapid whole-genome sequencing (rWGS) on a critically ill newborn leading to the discovery of an unreported pathogenic de novo SNAP25 c.529C > T; p.Gln177Ter variant. In this report, we present a novel case of CMS18 with complex neonatal consequence. This discovery offers unique insight into the extent of phenotypic severity in CMS18, expands the reported SNAP25 variant phenotype, and paves a foundation for personalized management for CMS18.


Assuntos
Síndromes Miastênicas Congênitas , Humanos , Mapeamento Cromossômico , Síndromes Miastênicas Congênitas/diagnóstico , Síndromes Miastênicas Congênitas/genética , Linhagem , Fenótipo , Proteína 25 Associada a Sinaptossoma/genética , Sequenciamento Completo do Genoma
5.
Mol Genet Genomic Med ; 10(4): e1888, 2022 04.
Artigo em Inglês | MEDLINE | ID: mdl-35119225

RESUMO

BACKGROUND: Genetic disorders contribute to significant morbidity and mortality in critically ill newborns. Despite advances in genome sequencing technologies, a majority of neonatal cases remain unsolved. Complex structural variants (SVs) often elude conventional genome sequencing variant calling pipelines and will explain a portion of these unsolved cases. METHODS: As part of the Utah NeoSeq project, we used a research-based, rapid whole-genome sequencing (WGS) protocol to investigate the genomic etiology for a newborn with a left-sided congenital diaphragmatic hernia (CDH) and cardiac malformations, whose mother also had a history of CDH and atrial septal defect. RESULTS: Using both a novel, alignment-free and traditional alignment-based variant callers, we identified a maternally inherited complex SV on chromosome 8, consisting of an inversion flanked by deletions. This complex inversion, further confirmed using orthogonal molecular techniques, disrupts the ZFPM2 gene, which is associated with both CDH and various congenital heart defects. CONCLUSIONS: Our results demonstrate that complex structural events, which often are unidentifiable or not reported by clinically validated testing procedures, can be discovered and accurately characterized with conventional, short-read sequencing and underscore the utility of WGS as a first-line diagnostic tool.


Assuntos
Hérnias Diafragmáticas Congênitas , Proteínas de Ligação a DNA/genética , Genômica , Hérnias Diafragmáticas Congênitas/genética , Humanos , Recém-Nascido , Fatores de Transcrição/genética , Sequenciamento Completo do Genoma/métodos
6.
Curr Biol ; 31(22): 5069-5076.e5, 2021 11 22.
Artigo em Inglês | MEDLINE | ID: mdl-34551284

RESUMO

Vertebrate craniofacial morphogenesis is a highly orchestrated process that is directed by evolutionarily conserved developmental pathways.1,2 Within species, canalized development typically produces modest morphological variation. However, as a result of millennia of artificial selection, the domestic pigeon displays radical craniofacial variation within a single species. One of the most striking cases of pigeon craniofacial variation is the short-beak phenotype, which has been selected in numerous breeds. Classical genetic experiments suggest that pigeon beak length is regulated by a small number of genetic factors, one of which is sex linked (Ku2 locus).3-5 However, the genetic underpinnings of pigeon craniofacial variation remain unknown. Using geometric morphometrics and quantitative trait locus (QTL) mapping on an F2 intercross between a short-beaked Old German Owl (OGO) and a medium-beaked Racing Homer (RH), we identified a single Z chromosome locus that explains a majority of the variation in beak morphology in the F2 population. Complementary comparative genomic analyses revealed that the same locus is strongly differentiated between breeds with short and medium beaks. Within the Ku2 locus, we identified an amino acid substitution in the non-canonical Wnt receptor ROR2 as a putative regulator of pigeon beak length. The non-canonical Wnt pathway serves critical roles in vertebrate neural crest cell migration and craniofacial morphogenesis.6,7 In humans, ROR2 mutations cause Robinow syndrome, a congenital disorder characterized by skeletal abnormalities, including a widened and shortened facial skeleton.8,9 Our results illustrate how the extraordinary craniofacial variation among pigeons can reveal genetic regulators of vertebrate craniofacial diversity.


Assuntos
Anormalidades Craniofaciais , Nanismo , Deformidades Congênitas dos Membros , Anormalidades Urogenitais , Animais , Columbidae/genética , Anormalidades Craniofaciais/genética , Nanismo/genética , Deformidades Congênitas dos Membros/genética , Anormalidades Urogenitais/genética
7.
Mol Biol Evol ; 38(12): 5376-5390, 2021 12 09.
Artigo em Inglês | MEDLINE | ID: mdl-34459920

RESUMO

The iris of the eye shows striking color variation across vertebrate species, and may play important roles in crypsis and communication. The domestic pigeon (Columba livia) has three common iris colors, orange, pearl (white), and bull (dark brown), segregating in a single species, thereby providing a unique opportunity to identify the genetic basis of iris coloration. We used comparative genomics and genetic mapping in laboratory crosses to identify two candidate genes that control variation in iris color in domestic pigeons. We identified a nonsense mutation in the solute carrier SLC2A11B that is shared among all pigeons with pearl eye color, and a locus associated with bull eye color that includes EDNRB2, a gene involved in neural crest migration and pigment development. However, bull eye is likely controlled by a heterogeneous collection of alleles across pigeon breeds. We also found that the EDNRB2 region is associated with regionalized plumage depigmentation (piebalding). Our study identifies two candidate genes for eye colors variation, and establishes a genetic link between iris and plumage color, two traits that vary widely in the evolution of birds and other vertebrates.


Assuntos
Columbidae , Cor de Olho , Alelos , Animais , Bovinos , Columbidae/genética , Cor de Olho/genética , Genômica , Masculino , Melhoramento Vegetal
8.
Nat Genet ; 50(11): 1617, 2018 11.
Artigo em Inglês | MEDLINE | ID: mdl-30224652

RESUMO

When published, this article did not initially appear open access. This error has been corrected, and the open access status of the paper is noted in all versions of the paper. Additionally, affiliation 16 denoting equal contribution was missing from author Robb Krumlauf in the PDF originally published. This error has also been corrected.

10.
Bioinformatics ; 34(21): 3616-3623, 2018 11 01.
Artigo em Inglês | MEDLINE | ID: mdl-29701825

RESUMO

Motivation: Genetic variation that disrupts gene function by altering gene splicing between individuals can substantially influence traits and disease. In those cases, accurately predicting the effects of genetic variation on splicing can be highly valuable for investigating the mechanisms underlying those traits and diseases. While methods have been developed to generate high quality computational predictions of gene structures in reference genomes, the same methods perform poorly when used to predict the potentially deleterious effects of genetic changes that alter gene splicing between individuals. Underlying that discrepancy in predictive ability are the common assumptions by reference gene finding algorithms that genes are conserved, well-formed and produce functional proteins. Results: We describe a probabilistic approach for predicting recent changes to gene structure that may or may not conserve function. The model is applicable to both coding and non-coding genes, and can be trained on existing gene annotations without requiring curated examples of aberrant splicing. We apply this model to the problem of predicting altered splicing patterns in the genomes of individual humans, and we demonstrate that performing gene-structure prediction without relying on conserved coding features is feasible. The model predicts an unexpected abundance of variants that create de novo splice sites, an observation supported by both simulations and empirical data from RNA-seq experiments. While these de novo splice variants are commonly misinterpreted by other tools as coding or non-coding variants of little or no effect, we find that in some cases they can have large effects on splicing activity and protein products and we propose that they may commonly act as cryptic factors in disease. Availability and implementation: The software is available from geneprediction.org/SGRF. Supplementary information: Supplementary information is available at Bioinformatics online.


Assuntos
Éxons , Splicing de RNA , Software , Humanos , Anotação de Sequência Molecular , Análise de Sequência de RNA
11.
G3 (Bethesda) ; 8(5): 1391-1398, 2018 05 04.
Artigo em Inglês | MEDLINE | ID: mdl-29519939

RESUMO

The domestic rock pigeon (Columba livia) is among the most widely distributed and phenotypically diverse avian species. C. livia is broadly studied in ecology, genetics, physiology, behavior, and evolutionary biology, and has recently emerged as a model for understanding the molecular basis of anatomical diversity, the magnetic sense, and other key aspects of avian biology. Here we report an update to the C. livia genome reference assembly and gene annotation dataset. Greatly increased scaffold lengths in the updated reference assembly, along with an updated annotation set, provide improved tools for evolutionary and functional genetic studies of the pigeon, and for comparative avian genomics in general.


Assuntos
Columbidae/genética , Genoma , Anotação de Sequência Molecular , Animais , Mapeamento Cromossômico , Elementos de DNA Transponíveis/genética , Biblioteca Gênica , Marcadores Genéticos , Genótipo , Sintenia/genética , Transcriptoma/genética
12.
Nat Genet ; 50(2): 270-277, 2018 02.
Artigo em Inglês | MEDLINE | ID: mdl-29358652

RESUMO

The sea lamprey (Petromyzon marinus) serves as a comparative model for reconstructing vertebrate evolution. To enable more informed analyses, we developed a new assembly of the lamprey germline genome that integrates several complementary data sets. Analysis of this highly contiguous (chromosome-scale) assembly shows that both chromosomal and whole-genome duplications have played significant roles in the evolution of ancestral vertebrate and lamprey genomes, including chromosomes that carry the six lamprey HOX clusters. The assembly also contains several hundred genes that are reproducibly eliminated from somatic cells during early development in lamprey. Comparative analyses show that gnathostome (mouse) homologs of these genes are frequently marked by polycomb repressive complexes (PRCs) in embryonic stem cells, suggesting overlaps in the regulatory logic of somatic DNA elimination and bivalent states that are regulated by early embryonic PRCs. This new assembly will enhance diverse studies that are informed by lampreys' unique biology and evolutionary/comparative perspective.


Assuntos
Reprogramação Celular/genética , Evolução Molecular , Genoma , Células Germinativas/metabolismo , Mutagênese/fisiologia , Petromyzon/genética , Vertebrados/genética , Animais , Montagem e Desmontagem da Cromatina/genética , Vertebrados/classificação
13.
Nucleic Acids Res ; 46(6): e32, 2018 04 06.
Artigo em Inglês | MEDLINE | ID: mdl-29294048

RESUMO

High-throughput sequencing data are increasingly being made available to the research community for secondary analyses, providing new opportunities for large-scale association studies. However, heterogeneity in target capture and sequencing technologies often introduce strong technological stratification biases that overwhelm subtle signals of association in studies of complex traits. Here, we introduce the Cross-Platform Association Toolkit, XPAT, which provides a suite of tools designed to support and conduct large-scale association studies with heterogeneous sequencing datasets. XPAT includes tools to support cross-platform aware variant calling, quality control filtering, gene-based association testing and rare variant effect size estimation. To evaluate the performance of XPAT, we conducted case-control association studies for three diseases, including 783 breast cancer cases, 272 ovarian cancer cases, 205 Crohn disease cases and 3507 shared controls (including 1722 females) using sequencing data from multiple sources. XPAT greatly reduced Type I error inflation in the case-control analyses, while replicating many previously identified disease-gene associations. We also show that association tests conducted with XPAT using cross-platform data have comparable performance to tests using matched platform data. XPAT enables new association studies that combine existing sequencing datasets to identify genetic loci associated with common diseases and other complex traits.


Assuntos
Biologia Computacional/métodos , Estudo de Associação Genômica Ampla/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Polimorfismo de Nucleotídeo Único , Algoritmos , Neoplasias da Mama/genética , Estudos de Casos e Controles , Doença de Crohn/genética , Feminino , Predisposição Genética para Doença/genética , Humanos , Masculino , Neoplasias Ovarianas/genética , Software
14.
G3 (Bethesda) ; 7(9): 3157-3167, 2017 09 07.
Artigo em Inglês | MEDLINE | ID: mdl-28751502

RESUMO

A reference genome sequence for Pseudotsuga menziesii var. menziesii (Mirb.) Franco (Coastal Douglas-fir) is reported, thus providing a reference sequence for a third genus of the family Pinaceae. The contiguity and quality of the genome assembly far exceeds that of other conifer reference genome sequences (contig N50 = 44,136 bp and scaffold N50 = 340,704 bp). Incremental improvements in sequencing and assembly technologies are in part responsible for the higher quality reference genome, but it may also be due to a slightly lower exact repeat content in Douglas-fir vs. pine and spruce. Comparative genome annotation with angiosperm species reveals gene-family expansion and contraction in Douglas-fir and other conifers which may account for some of the major morphological and physiological differences between the two major plant groups. Notable differences in the size of the NDH-complex gene family and genes underlying the functional basis of shade tolerance/intolerance were observed. This reference genome sequence not only provides an important resource for Douglas-fir breeders and geneticists but also sheds additional light on the evolutionary processes that have led to the divergence of modern angiosperms from the more ancient gymnosperms.


Assuntos
Genoma de Planta , Fotossíntese/genética , Pinaceae/genética , Pinaceae/metabolismo , Pseudotsuga/genética , Pseudotsuga/metabolismo , Sequenciamento Completo do Genoma , Adaptação Biológica/genética , Biologia Computacional , Evolução Molecular , Duplicação Gênica , Redes Reguladoras de Genes , Genômica , Anotação de Sequência Molecular , Família Multigênica , Filogenia , Pinaceae/classificação , Proteômica/métodos , Pseudotsuga/classificação , Sequências Repetitivas de Ácido Nucleico
15.
Bioinformatics ; 33(10): 1437-1446, 2017 May 15.
Artigo em Inglês | MEDLINE | ID: mdl-28011790

RESUMO

MOTIVATION: The accurate interpretation of genetic variants is critical for characterizing genotype-phenotype associations. Because the effects of genetic variants can depend strongly on their local genomic context, accurate genome annotations are essential. Furthermore, as some variants have the potential to disrupt or alter gene structure, variant interpretation efforts stand to gain from the use of individualized annotations that account for differences in gene structure between individuals or strains. RESULTS: We describe a suite of software tools for identifying possible functional changes in gene structure that may result from sequence variants. ACE ('Assessing Changes to Exons') converts phased genotype calls to a collection of explicit haplotype sequences, maps transcript annotations onto them, detects gene-structure changes and their possible repercussions, and identifies several classes of possible loss of function. Novel transcripts predicted by ACE are commonly supported by spliced RNA-seq reads, and can be used to improve read alignment and transcript quantification when an individual-specific genome sequence is available. Using publicly available RNA-seq data, we show that ACE predictions confirm earlier results regarding the quantitative effects of nonsense-mediated decay, and we show that predicted loss-of-function events are highly concordant with patterns of intolerance to mutations across the human population. ACE can be readily applied to diverse species including animals and plants, making it a broadly useful tool for use in eukaryotic population-based resequencing projects, particularly for assessing the joint impact of all variants at a locus. AVAILABILITY AND IMPLEMENTATION: ACE is written in open-source C ++ and Perl and is available from geneprediction.org/ACE. CONTACT: myandell@genetics.utah.edu or tim.reddy@duke.edu. SUPPLEMENTARY INFORMATION: Supplementary information is available at Bioinformatics online.


Assuntos
Genômica/métodos , Polimorfismo Genético , Análise de Sequência de RNA/métodos , Software , Animais , Eucariotos/genética , Éxons , Haplótipos , Humanos , Mutação , Splicing de RNA
16.
Science ; 348(6239): 1139-43, 2015 Jun 05.
Artigo em Inglês | MEDLINE | ID: mdl-25977371

RESUMO

The evolution of eusociality is one of the major transitions in evolution, but the underlying genomic changes are unknown. We compared the genomes of 10 bee species that vary in social complexity, representing multiple independent transitions in social evolution, and report three major findings. First, many important genes show evidence of neutral evolution as a consequence of relaxed selection with increasing social complexity. Second, there is no single road map to eusociality; independent evolutionary transitions in sociality have independent genetic underpinnings. Third, though clearly independent in detail, these transitions do have similar general features, including an increase in constrained protein evolution accompanied by increases in the potential for gene regulation and decreases in diversity and abundance of transposable elements. Eusociality may arise through different mechanisms each time, but would likely always involve an increase in the complexity of gene networks.


Assuntos
Abelhas/genética , Evolução Molecular , Deriva Genética , Comportamento Social , Transcriptoma , Aminoácido N-Acetiltransferase , Animais , Abelhas/classificação , Elementos de DNA Transponíveis , Regulação da Expressão Gênica , Redes Reguladoras de Genes , Genoma de Inseto/genética , Filogenia , Seleção Genética , Fatores de Transcrição/química , Fatores de Transcrição/genética
17.
Plant Physiol ; 167(1): 25-39, 2015 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-25384563

RESUMO

The large size and relative complexity of many plant genomes make creation, quality control, and dissemination of high-quality gene structure annotations challenging. In response, we have developed MAKER-P, a fast and easy-to-use genome annotation engine for plants. Here, we report the use of MAKER-P to update and revise the maize (Zea mays) B73 RefGen_v3 annotation build (5b+) in less than 3 h using the iPlant Cyberinfrastructure. MAKER-P identified and annotated 4,466 additional, well-supported protein-coding genes not present in the 5b+ annotation build, added additional untranslated regions to 1,393 5b+ gene models, identified 2,647 5b+ gene models that lack any supporting evidence (despite the use of large and diverse evidence data sets), identified 104,215 pseudogene fragments, and created an additional 2,522 noncoding gene annotations. We also describe a method for de novo training of MAKER-P for the annotation of newly sequenced grass genomes. Collectively, these results lead to the 6a maize genome annotation and demonstrate the utility of MAKER-P for rapid annotation, management, and quality control of grasses and other difficult-to-annotate plant genomes.


Assuntos
Genes de Plantas/genética , Genoma de Planta/genética , Anotação de Sequência Molecular/métodos , Zea mays/genética , Bases de Dados Genéticas/normas , Éxons/genética , Íntrons/genética , Modelos Genéticos , Anotação de Sequência Molecular/normas , Pseudogenes/genética , Controle de Qualidade , RNA não Traduzido/genética
18.
Curr Protoc Bioinformatics ; 48: 4.11.1-4.11.39, 2014 Dec 12.
Artigo em Inglês | MEDLINE | ID: mdl-25501943

RESUMO

This unit describes how to use the genome annotation and curation tools MAKER and MAKER-P to annotate protein-coding and noncoding RNA genes in newly assembled genomes, update/combine legacy annotations in light of new evidence, add quality metrics to annotations from other pipelines, and map existing annotations to a new assembly. MAKER and MAKER-P can rapidly annotate genomes of any size, and scale to match available computational resources.


Assuntos
Genoma Humano , Software , Humanos
19.
Genome Biol ; 15(3): R59, 2014 Mar 04.
Artigo em Inglês | MEDLINE | ID: mdl-24647006

RESUMO

BACKGROUND: The size and complexity of conifer genomes has, until now, prevented full genome sequencing and assembly. The large research community and economic importance of loblolly pine, Pinus taeda L., made it an early candidate for reference sequence determination. RESULTS: We develop a novel strategy to sequence the genome of loblolly pine that combines unique aspects of pine reproductive biology and genome assembly methodology. We use a whole genome shotgun approach relying primarily on next generation sequence generated from a single haploid seed megagametophyte from a loblolly pine tree, 20-1010, that has been used in industrial forest tree breeding. The resulting sequence and assembly was used to generate a draft genome spanning 23.2 Gbp and containing 20.1 Gbp with an N50 scaffold size of 66.9 kbp, making it a significant improvement over available conifer genomes. The long scaffold lengths allow the annotation of 50,172 gene models with intron lengths averaging over 2.7 kbp and sometimes exceeding 100 kbp in length. Analysis of orthologous gene sets identifies gene families that may be unique to conifers. We further characterize and expand the existing repeat library based on the de novo analysis of the repetitive content, estimated to encompass 82% of the genome. CONCLUSIONS: In addition to its value as a resource for researchers and breeders, the loblolly pine genome sequence and assembly reported here demonstrates a novel approach to sequencing the large and complex genomes of this important group of plants that can now be widely applied.


Assuntos
Mapeamento de Sequências Contíguas/métodos , Genoma de Planta , Pinus taeda/genética , Análise de Sequência de DNA/métodos , DNA de Plantas/genética , Haploidia
20.
Genetics ; 196(3): 891-909, 2014 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-24653211

RESUMO

The largest genus in the conifer family Pinaceae is Pinus, with over 100 species. The size and complexity of their genomes (∼20-40 Gb, 2n = 24) have delayed the arrival of a well-annotated reference sequence. In this study, we present the annotation of the first whole-genome shotgun assembly of loblolly pine (Pinus taeda L.), which comprises 20.1 Gb of sequence. The MAKER-P annotation pipeline combined evidence-based alignments and ab initio predictions to generate 50,172 gene models, of which 15,653 are classified as high confidence. Clustering these gene models with 13 other plant species resulted in 20,646 gene families, of which 1554 are predicted to be unique to conifers. Among the conifer gene families, 159 are composed exclusively of loblolly pine members. The gene models for loblolly pine have the highest median and mean intron lengths of 24 fully sequenced plant genomes. Conifer genomes are full of repetitive DNA, with the most significant contributions from long-terminal-repeat retrotransposons. In depth analysis of the tandem and interspersed repetitive content yielded a combined estimate of 82%.


Assuntos
Genoma de Planta , Anotação de Sequência Molecular/métodos , Pinus taeda/genética , DNA de Plantas/análise , Evolução Molecular , Genes de Plantas , Família Multigênica , Filogenia , Alinhamento de Sequência
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...